Skip to content

Revert use_gram_newton_schulz default to False due to bug#55

Merged
JohnLangford merged 1 commit intomicrosoft:mainfrom
NoahAmsel:amsel/rollback-gram-ns
Apr 8, 2026
Merged

Revert use_gram_newton_schulz default to False due to bug#55
JohnLangford merged 1 commit intomicrosoft:mainfrom
NoahAmsel:amsel/rollback-gram-ns

Conversation

@NoahAmsel
Copy link
Copy Markdown

Reverts the default value of use_gram_newton_schulz back to False in all optimizers (Dion2, Muon, NorMuon, and DistributedOrthoBase) due to a bug in the gram Newton-Schulz implementation.

This was set to True in #54 but causes issues, so this PR rolls it back until the bug is fixed.

@JohnLangford JohnLangford merged commit dc223d5 into microsoft:main Apr 8, 2026
1 check passed
@NoahAmsel NoahAmsel deleted the amsel/rollback-gram-ns branch April 8, 2026 19:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants